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Abstract 

A centralized coordinated multipoint downlink joint transmission in a frequency division duplex 
system requires channel state information (CSI) to be fed back from the cell-edge users to their serving 
BS, and aggregated at the central coordination node for precoding, so that interference can be mitigated. 
The control signals comprising of CSI and the precoding weights can easily overwhelm the backhaul 
resources. Relative thresholding has been proposed to alleviate the burden; however, this is at the cost of 
reduction in throughput. In this paper, we propose utilizing the long term channel statistics comprising 
of pathloss and shadow fading in the precoder design to model the statistical interference for the 
unknown CSI. In this regard, a successive second order cone programming (SSOCP) based precoder for 
maximizing the weighted sum rate is proposed. The accuracy of the solution obtained is bounded with 
the branch and bound technique. An alternative optimization framework via weighted mean square error 
minimization is also derived. Both these approaches provide an efficient solution close to the optimal, 
and also achieve efficient backhauling, in a sense that the precoding weights are generated only for 
the active links. For comparison, a stochastic approach based on particle swarm optimization is also 
considered. 
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I. Introduction 

In cellular coordinated multipoint (CoMP) transmission systems, the channel state informa¬ 
tion (CSI) present at the transmitter plays an important role in harnessing the gains for joint 
transmission CoMP. In the downlink, a group of base stations (BSs) coordinate to coherently 
serve a group of users being prone to interference [[I]|-[l3l|. To mitigate interference in a frequency 
division duplex (FDD) system, the users need to estimate the CSI based on the downlink pilots 
from the BSs, and then feed it back to its cooperating BSs (typically to the serving BS). In 
a centralized architecture, the BSs forward the CSI to a central coordination node, where the 
CSI from various users are accumulated to form the aggregated channel matrix which is used 
to design a precoder for mitigating interference. In a decentralized architecture, the users need 
to share the CSI between the cooperating BSs to form the precoding weights. Sharing of CSI 
and the precoding weights between the BSs and the central coordination node occurs over the 
backhaul. This is typically a microwave or an optical fiber link. 

A. Previous work 

In an FDD system, the overhead of feeding back the CSI of all the cooperating BSs from the 
user could easily overwhelm the wireless radio interface and the backhaul resources, especially 
in a centralized architecture. In this regard, absolute and relative thresholding dU was proposed 
to limit the CSI feedback. In particular, relative thresholding is a process in which the users 
only feedback those links that fall within a threshold, say 5 dB, relative to its strongest BS. 
This results in limited CSI being available for the precoder design. A user centric clustering is 
performed in [[Sl, which is similar to the relative thresholding performed in our work. However, 
the aim is finding the optimal tradeoff between total transmit power and sum backhaul capacity 
via power minimization. 

In DU, D3, linear precoding is considered, as it provides a good tradeoff between complexity 
and performance. With limited CSI, in DSl a linear zero forcing (ZF) precoder with suboptimal 
power allocation DSl is considered to achieve the backhaul signaling load reduction based on a 
physical (PHY) layer precoding and a medium access control (MAC) layer scheduling approach. 
Apart from the PHY and MAC layer approaches, a predefined constrained backhaul infrastructure 
can be included in the precoder design as in D3l- The ZF approach requires a well conditioned 
aggregated channel matrix at the central coordination node for channel inversion, which cannot be 
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guaranteed with limited CSI. This poses eonstraints on how the users are seleeted/seheduled, and 
it makes it harder to aehieve effieient baekhauling. In this paper, the term effieient baekhauling 
is used to denote the ease where the number of preeoding weights generated for the active links 
is equal to the number of CSI coefficients correspondingly available for the active links at the 
central coordination node. Note that for example with the ZF approach, it is possible to generate 
the non-zero precoding weights for non-cooperating BSs and require them to be nulled to achieve 
efficient baekhauling. In a centralized architecture, if the central coordination node decides the 
routing of user data then this will be based on the precoding weights being generated for only 
the active links, instead of making all the user data to be available at all the cooperating BSs. 
Such an approach requires efficient baekhauling. Another approach to minimize the backhaul 
user data transfer is to jointly design the precoder and simultaneously minimize the user data 
transfer in the backhaul based on the quality of service [fTOll . In ifTTI . a stochastic precoder 
based on particle swarm optimization (PSO) is designed to achieve efficient baekhauling while 
taking the limited feedback and limited backhaul into account. However, with suboptimal power 
allocation and with increase in the problem size, the complexity of the algorithm increases as 
pointed out in [HIl sec. 3.4]. 

Weighted sum rate maximization is a difficult non-convex problem llT^ . [[T3l . In this regard, 
different centralized successive convex approximation (SCA) methods are proposed in [IT4ll - [|T^ . 
In [fTSlI . a low complexity approximation with faster convergence rate is proposed for a downlink 
multicell multiple input single output (MISO) system. In [flbl . a different approximation is used 
for robust precoding with uncertainty in CSI at the transmitter. The precoders can also be designed 
via the mean square error (MSE) approach. In f[T^ . IfTTI . it was shown that minimizing the 
weighted sum mean square error (MSE) is equivalent to the weighted sum rate maximization, 
where the precoder, receive weight and the receiver MSE weights are alternately optimized. 
Whenever a central coordination node is not available, then lfT2ll . IfTSl can be used to implement 
the precoder in a decentralized fashion. Signaling strategies are considered in IfT^ . and also under 
imperfect channel conditions [fT9ll extending the result from ifTTIl . Also, in [l20l . a generalized 
mean square error criterion is used to arrive at a robust linear precoding solution that can handle 
backhaul constraints with CSI uncertainty. Similar to weighted sum rate maximization, a cross 
layer queue deviation minimization is considered in Ifldll where the queue states act as weights 
for the sum rate maximization, with a different approximation of the signal to interference plus 
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noise (SINK) constraint. 

B. Contributions 

In this work, we focus on the design of the precoder in a centralized FDD system with 
the objective of maximizing the weighted sum rate of the users with perfect but limited CSI 
feedback, and also under limited backhauling. We use the algorithms developed for the full CSI 
case, but now we incorporate the limited CSI and the statistical model of interference. In this 
regard, we propose a conservative precoder design for any combination of user centric clustering 
with per-antenna power constraint, where the long term channel statistics is incorporated into 
the optimization problem for the missing links. Here, we model the statistical interference for 
the unknown CSI as the long term channel statistics in the interference terms for the user. The 
model is pessimistic in nature, as the Cauchy-Schwarz inequality is applied on the unknown 
parts that were separated from the total interference. The long term channel statistics is also 
used for making the routing decisions for the user data in the backhaul as noted in [1211 . 

In our work, we effectively solve the problem of designing a PHY layer precoder with limited 
information based on the approach in lfT4ll - lfT^ . where we extend the SCA framework to cope 
with incomplete CSI at the transmitter. Here we consider joint transmission CoMP while lfT4ll - llT^ 
focused on coordinated beamforming. In this regard, we incorporate the pessimistic interference 
model based on the long term channel statistics (pathloss and shadow fading) into the problem 
formulation. Alternatively we also include this statistical interference model in the minimization 
of the weighted MSB IfTTlI - IIT^ . We also use the long term channel statistics to determine the 
CSI feedback threshold. Our proposed pessimistic statistical interference modeling is different 
compared to the previous work dUl, ||6l, ifTll . [l22l where the unavailable CSI are modeled as 
zeros. The availability of the long term statistics at the coordination node is a valid assumption, 
as they are available in the existing cellular standards, where the users feedback the received 
signal strength, more popularly referred to as the received signal strength indicator (RSSI). 
For example, this is required during handover procedures. In our setup, we consider relative 
thresholding, a variant of [HI, based on this average signal strength at the user. 

The main contributions of this work are listed as follows: 

• The long term channel statistics (pathloss and shadow fading), based on relative thresholding, 
are modeled in the precoder design as part of the pessimistic statistical interference in the 
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SINK ratio. 

• We efficiently solve the precoder design problem with the limited feedback and limited 
backhauling, using a successive second order cone programming (SSOCP). Also, we solve 
for the case when the long term channel statistics is considered as part of the SOC con¬ 
straint, instead of neither treating them as zeros nor naively replacing the zeros with this 
side information. We also achieve efficient backhauling, where the precoding weights are 
generated only for those links whose CSI was reported. 

• As an alternative to the SSOCP, we reformulate the problem via weighted MSE criterion 
similar to ifT^ . [fTTl . with the use of the proposed long term channel statistics in the variance 
of the received signal. The results show that it achieves the same weighted sum rate as that 
of the proposed SSOCP on average. The MSE reformulation requires a higher number of 
iterations than SSOCP to converge but each sub-problem is simple to solve. 

• We characterize the performance of the proposed precoder design using numerical bounds 
with a variant of the branch and bound technique [IT3l . The proposed iterative SSOCP 
algorithm is very close to the optimal provided by the branch and bound method. 

• We numerically compare the performance of the proposed iterative algorithm to an existing 
stochastic algorithm under limited feedback and limited backhauling. In particular we 
consider PSO, as the overhead of book keeping of variables is far simpler compared to 
other stochastic algorithms such as ant colony optimization or evolutionary algorithms. It 
was found that the performance of the PSO is inferior, especially when the problem size is 
increased. 

The paper is organized as follows: the system model is introduced in Section|I^ In Section|II^ the 
precoder design based on SSOCP and MSE are derived. A brief description of PSO is presented, 
and this section concludes with the branch and bound technique used to bound the performance 
of SSOCP. Using the derived precoders, the simulation results are presented in Section in 
terms of the effect of threshold, cell-edge signal to noise ratio (SNR), BS antennas and the 
SSOCP bounds. Einally Section |V] concludes the contribution of the paper. 

Notation: A scalar variable is denoted as x while X denotes a scalar constant. A vector and 
a matrix are denoted as x and X, respectively. A set is denoted in calligraphic font as X and 
the cardinality of the set is \X\. The elements of set X not in set y is denoted as The 

absolute value of xG C is denoted as |x| while the p—norm of a vector is denoted as || ■ ||p. 
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The transpose and conjugate transpose of a vector x is denoted as and x^, respectively. The 
expectation operation on the random variable X is denoted as Ex [-^]. 


II. System Model 


Consider a homogenous network cluster consisting of \B\ BSs, each with Nj antennas. The 
BSs are coordinated to serve \U\ single antenna cell-edge users. The signal received by the wth 
user is y^, and it consists of the desired signal and intracluster interference 


Vu = + nu, ( 1 ) 

bGBu i^ubGBi 

where Bu is the set of BSs from which the u\h user is served. In this model, the intercluster 
interference is considered to be negligible for the cell-edge users located at the cluster center, and 
therefore it is not accounted in ([T]). The channel experienced by the u\h user from 6th BS with 
antennas is h.b,u ^ The precoding weight for the wth user with normali z ed data Xu from 

the 6th BS with antennas is ^b,u e such that ^b,u = ..., ..., 

where is the precoding weight on the /cth antenna of the 6th BS for the wth user, and Uu is 
the receiver noise at wth user with power Nq. 

To incorporate the long term channel statistics, let us first consider the SINK evaluated at the 
central coordination node for the wth user as 


lu 


2 


b£Bu 


2 


i^u b£l3i 


+ No 


2 


b&B„ 


r 



E 



1 

beBillBn b&Bi\Bu 
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+ No 


( 2 ) 


where the interference terms in the denominator of @ are split based on relative thresholding, 
i.e., the set Bi n Bu denotes the set of BSs that are involved in serving both the wth and the fth 
user, as the CSI „ falls within the relative threshold window. However, those links that fall 
outside this threshold constitute the term h;, where Bi\Bu is the set of BSs serving the zth user 
but not the wth user. The given set Bu is defined by the relative thresholding algorithm based 
on the long term channel statistics as summarized in Algorithm To achieve the condition of 
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efficient backhauling, the preceding weights are generated only for those links for which the 
users have fed back the CSI. 


Algorithm 1 Relative thresholding performed at the user based on the long term channel statistics 
(pathloss and shadow fading) 

1: Set the feedback threshold, T (= 3dB, for example) 

2 : for \/u gU do 

3: Perform channel measurements of the BSs, B 

4: c = max (£; [||h;,,tj|||]) 

5: for yb G B do 

6: if {cdB- [E[\\hb,u\\l]]^f^) <T then 

7: Include b in the set Bu 

8 : end if 

9: end for 

10: The uth user feeds back the CSI of the set of BSs in Bu 

11: end for 


Now we define a new SINK, 7 ^ in ( [3a[ ), where we replace the unknown channel coefficients 
with an expectation as in ([^-([^. We obtain ( |3bl ) by expanding the terms as \a + = |ap + 

| 6 p + ab^ + ba^ = |ap + | 6 p + 23fj {a 6 ^}and taking the expectation inside. Here the hermitian 
operator is degenerated to a scalar case. Here we focus on , where „ consists of the 

three random variables, the pathloss, I, the shadow fading, Sb,u ~ InA/"( 0 , (t|p), and the small scale 


fading on the kth antenna is /; 


(fc) 

b,u 


CM (0,1) and f,,„ e = /", /£>, ■ ■ ■ , ft 


cW 


. The 


large scale fading and the small scale fading are independent random variables, and Ef [fb^u] = 0 , 


therefore we have Ej^ = Ei^sj [lb,uSb,Jb,u\ = Ei^s [h^uSb,: 


ity in (3d) is obtained using the Cauchy-Schwarz inequality 
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where aj = and bj = l,Vj. Finally, we obtain ( pe] ) as follows, Ej^\hb^u^b,i\ = 

'^b,i = where is the long term chan- 


Eb 




= <,Et^ 


^b,v)^b,u 


nel statistics of h 


b,u^ Ej^ 


^b,v)^b,u 


= We assume that the RSSI is reported by the 

user and the transmit antennas are uncorrelated. For correlated channels, covariance matrices 


Eb 




= C can be incorporated in the problem formulation. Finally, the weighted sum 
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(3b) 


(3c) 


(3d) 


(3e) 


rate maximization of \U\ users is evaluated as 

^tot = X]««log 2 (1 + lu) [bps/Hz], (4) 

U 

where au is a non-negative weight of the wth user. 

III. Precoder design 

Limited CSI at the eentral eoordination node makes the design of the preeoder all the more 
diffieult. In this seetion, we derive the preeoders with limited CSI that also inelude the long term 


ehannel statisties using the SINK definition from ([3e|). 
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A. Successive second order cone programming 

We propose a SSOCP to solve the problem of preeoder design with limited information. 
SSOCP is based on SCA that allows us to effieiently solve the problem with guaranteed eon- 
vergenee in every iteration. We adopt an optimization framework originally proposed in [fT 4 ll for 
linearizing a non-eonvex eonstraint that forms a eonstraint for the useful signal. We also adopt 
the teehniques in IfTSlI . [fT^ for handling the SINK, and reformulate as SOC eonstraints. The 
maximization of weighted sum rate i?tot with per-antenna power constrainj^ is formulated as 

maximize IT (1 + 7^)““ 

Wf,,u 

“ ( 5 ) 

subjeet to ^ kS - Tnax, V6 e fc = 1 ,..., N^, 

uGUi, 

where the logarithm being a monotonieally non-deereasing funetion ean be removed from the 
objeetive, and Pmax is the maximum transmit power of an antenna of a BS serving a set of Ub 


users. This ean be reeast by letting tu = (1 -I- 7^)““ where 7„ is from ( [^ and adding a slaek 
variable fdu as 


maximize 


subjeet to 


n< 


beBu 



> - 1 ,Vm e W, 


(6a) 


(6b) 


beBiDBu 


\Bi\Bu\ ^ -f iVo < eu, (6e) 

beBi\Bu 


(fc )|2 


< -fmax, V6 G Bu, k — 1 ,, Nj. 


(6d) 


u£Ut, 


The LHS of ( [6b| ) is of the form quadratie over linear, whieh is eonvex funetion, and is 
eonvex only when 0 < «„ < 1 , and eoneave when > 1 . Thus, the eonstraint is non-eonvex. A 
eoneave approximation of the LHS ean be obtained as in [IT 4 l (6b)], so we define the following 
expressions 
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Pu = 


and 

[b&Bu ) 


A pv- 

qu = -s 




( 7 ) 


'over all data symbols for a given channel realization 
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(pI+qI) 


By applying the first order Taylor expansion for ^ in LHS of ( |6b| ) around the loeal point 

Pu, Qu, ]3u] yueU, we get 


Pu Pu Pu 

,1/a 


I3u - l^u 


+ 1 > ty“T 


( 8 ) 


When > 1, “ in the RHS of @ is not eonvex, so it needs to be replaeed by its upper 

bound. Doing as in with the first order approximation at the point tu, we obtain 


^l/a^ < + —f ““ ' {tu - tu) . 

Clu 


(9) 


Therefore, eombining with we get 

Pu Pu Pu 


1 - 


(iu - 


+ 1 


+ — {tu-L)- 

au 

Now eonsider ( |6^ whieh ean be rewritten as an SOC eonstraint [|T^ 

2 


( 10 ) 
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i^u 


b^SiOSu 


1/2 


+ \Bi\Bu\ ^ '^b,ul|Wfe,i||2 /• + + - (/3„ — 1) 


b&Bi\Bu 




Therefore, the reformulated eonvex problem for preeoder design with the objeetive of maximizing 
the geometrie mean of tu beeomes 


maximize 
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tu 

\ u= 

1 / 


1/\U\ 


( 12 ) 


subjeet to ( |6dl ), ( fTO] ) and ( [TT] ), 

where the geometrie mean is eoneave, and the exponent does not affeet the optimal value. This 
is performed merely to simplify the implementation. Also, the interfering terms ean be eolleeted 
in a veetor as 


r, = 


b£B^nBu 


, b' e Bi\Bu, Vf ^ u. 


( 13 ) 


^/\Bi\Bu\Xb',u^b'. 

The SSOCP with the above simplified notation is summarized in Algorithm 
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Algorithm 2 SSOCP algorithm for precoder design 

1: To avoid numerical instability, rescale the aggregated channel matrix and the noise power with a factor of the 
least pathloss such that the SINR is the same. 

2: Set max Retries = MAXRETRIES, see Eig. |^for a possible choice. 

3: while maxRetries do 

4: Randomly initialize the non-zero precoding weight, Wf, from CAf{0, 1), and ensure the power of each 

antenna is limited to Pmax- 
5: Calculate 7 „ based on ( [^ , Vu. 

6: Set n = 0 

7: Evaluate and from (|7]i. 

8: Evaluate 4"^ = (1 + JuT" and ^ 

9: Set maxiter = MAXITER 
10: while maxiter AND f do 

11: Treat and as expressions in CVX which will be used in ( [TOl l. 

12: Solve the convex problem ( |T2l i as 

maximize geo_mean (f„) 

ru,/3u.wi,_u 


subject to 

2 


Vi 

5 {Pn - 1 ) 


and ([Toll, Vu € U. 

13: Update: 4”+'^ = tt ^, = /?i”) 

14: Update: = gi"^ 

15: Update: n = n + 1 

16: maxiter = maxiter — 1 

17: Evaluate and save the best weighted sum rate achieved so far, as well as the corresponding precoding 

weights. 

18: end while 

19: maxRetries = maxRetries — 1 

20: end while 

t The weighted sum rate does not improve within a certain tolerance. 
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The weighted sum rate maximization is a non-eonvex problem, and the solution may end up 
as an ineffieient loeal optimum. In order to further improve the solution, we introduee random 
initialization similar to [[231, where we select the best solution out of a number of random 
initialization. For a given aggregated channel matrix, a small increase in the number of random 
initializations, as in stepQ increases the probability to find a solution close to the global optimal 


The convergence of the proposed SSOCP algorithm closely follows the analysis carried out for 
the full CSI case in ffT4l - [[T^ . Reformulating the SINK constraints to cope with incomplete CSI 
does not affect the convergence of the SCA. The interference and noise terms are transformed 


into a SOC from ( [6c| ), and the convex function in ( [6b[ ) is approximated with a linear lower 
bound at each iteration. This results in an SCA for every iteration, where the objective is 
monotonically non-decreasing, thereby guaranteeing convergence. In the subsequent section, we 
apply the branch and bound technique to show that the proposed SSOCP is very close to the 
optimal. 


B. Optimization via weighted mean square error minimization 


Maximizing the weighted sum rate can be equivalently formulated as minimizing the weighted 
sum MSE lfT2l . IfTTl . In this section, we extend this result to the case of limited CSI and efficient 
backhauling. We derive the precoder based on weighted MSE formulation that accounts for 
using long term channel statistics in the precoder design when there is limited CSI at the central 
coordination node. Consider the received signal at the u\h user as in ([T]). The estimated u\h user 
data at the receiver is = OuUu, where a„ G C is the receiver weight. The MSE at the wth 
receiver can be formulated as 


= E., 




H 


= E. 




{Xu - Xu) {Xu - Xu) 


{Xu (^uVu) ^uVu) 


H 


~ Exu Ex^^riu [^uVu ] 0>uExu,nu Ex^^rin [f/ttl/u ] 

= 1 - af ^ X] ^b,u^b,u + 




beBu 


beBu 


(14) 
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where = 1 as the user data is zero mean with unit power and [VuVu] 

is the varianee of the reeeived signal whieh ean be evaluated as 


c = E 

^Xu.TLu 


[VuVu] 


Y,Kn^h,u {Ku^b,uf + u'^b,i + -En„ 

i^u b&Bi 


b&B. 


EEh. vy^b,i (Mb,u^b,i)^ + Nq, 


(15) 


Vi beBu 


where En^ [riuTi^] = Nq. Similar to d^ , we split the interferenee terms to ineorporate the 
long term ehannel statisties for the unknown ehannel eomponents, and bound the varianee to be 
pessimistie as 


Cu = iVn + > ^ E E + \Bi\Bu\ E ALlK,i||5 .(16) 

i^u beBiCiBu b&Bi\Bu J 

The detailed steps are in Appendix Therefore, the MSB in ( [T4| ) has Cu replaeed with c„. To 
find the optimal reeeive weight, a* = we need to take the Va„ i^u) = 0, whieh works 

out to be 


at = 


{hb,u^b,u)^ G ^ 


(17) 


bGBu 


Therefore, the MMSE is 


bsBu b^Bu 

= 1 - aiy^hb^u^b,u = 

b£Bu 


1 + 


(18) 


Thus, maximizing the weighted sum rate Q ean be formulated equivalent to a log(MSE) 
minimization problem iTT^ . ifTTl as 

minimize ^ Q;.ulog 2 ^n 


(19) 


subjeet to 

where q;„ is a non-negative weight. The problem ( [T9| ) is non-eonvex, so as a first step to find 
a traetable loeal solution we introduee a new variable, ^u, to upper bound the MSB as well as 
treat the reeeive sealar, a„, as an optimization variable. The reformulated problem is 


minimize y^a„log 2 .Cii 

^b,ui^u}^u ,, 


(20a) 
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subject to 1 - ^ ^ + a^a^Cu < L, Vm, 


bGBu 


beBu 


(6d). 


(20b) 

(20c) 


Still the MSE constraint ( |20b| ) is not jointly convex with respect to both „ and a„. However, 
for a fixed receiver, a^, the constraint becomes convex. A successive linear approximation of 
the concave objective is carried out at the point in the nth iteration as 


log^e^^ ^ log^e, 


( 21 ) 


where is the linearizing coefficient, which is a non-negative MSE weight for the nth iteration, 
and it works out to be 


dl") = ^ 


e 


'(n) ' 


( 22 ) 


Subsituting pT|) in the objective (|20a|), and considering only those terms that depend on w;, for 


a fixed a„, while ignoring the constant terms and iteration index in pT] ), the objective becomes 

mininuze '^^auduiu- (23) 

Eurthermore, the MSE constraint is tight at the optimal solution. Thus, we replace the MSE 
upper bound with the actual MSE expression as 


C ^ ^j O^udu^u 


(24) 


\ du(^u ^ 

> ^ ^^O-iiduClu \ 


1 h^Bu J 


b^Bu 







+ \BABu\ 

b&BABu 



(25) 


beB^nBu 

Eor fixed a„, (21) can be solved via successive linearization until convergence. Eor fixed w^ ,,, 
the optimal solution of (21) is given by the optimal receiver weight, a„ in ( [T7] ). This leads to 
alternating optimization with monotonic convergence. In practice, we update au and then du just 
once without sacrificing monotonicity of the objective. Thus, the original problem (21) can be 
split as a 3-stage algorithm IfT^ . IfTTl . BUI, where the receiver weights, linearizing coefficients, 
and the precoders are optimized in an alternating manner, i.e., (i) the receiver weights are updated 
for a given precoder, (ii) the linearizing coefficients are updated for a given precoder, and (iii) 
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the precoders are evaluated for the given receiver weights and linearizing coefficients. When 
compared to JUl, ifTTll . ifTSll . we design the precoder with limited information and achieve 
efficient backhauling in a JT-CoMP scenario with per-antenna power constraint. The MSE based 
precoder design with limited information is outlined in Algorithm The convergence is evaluated 
based on the MSE of each user as —This is a monotonically non-decreasing function, 

U 

and the algorithm is terminated when there is no further improvement. 


Algorithm 3 The MSE approach for finding the precoder 

1: Randomly initialize every precoding weight, W6 „, from CW(0,1), apply the equivalent limited backhauling, 
and ensure the power of each antenna is limited to Pmax- 
2: while Convergence do 
3: Evaluate; a„, c„,Vm 

4: Solve the convex problem as 


minimize 


subject to 



-|- Oiudu^u^u ^ ^ ^ \ 



bGB,nB„ 




I E 


X2 


J|wm 



< A, Vu G U, 


Based on precoding weights, steps HE below are applied Vu. 
5: Update; receiver variance, Cu, based on 

6: Update; receiver weight, a„, based on ( [T7] i 

7: Update; linearizing coefficient, du, based on ( |2^ 

8 : end while 

9: return Precoding matrix 


The problem of minimizing, (, can be solved either with generic solvers, such as those provided 
with the CVX package [|24ll . where the per-antenna power constraint is formulated as an SOC 
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program, or solving iteratively via the Karush-Kuhn-Tueker eonditions as highlighted in [fT8l 
Appx. A]. Note that the latter approaeh may be preferrable when the number of power eon- 
straints (eorresponding to the dual variables) is relatively small. The MSE algorithm guarantees 
eonvergenee as shown in ITT^ Thm. 3]. The main differenee to [|T^ is that here the reeeive 
varianee is affeeted by the long term ehannel statisties, thus the same eonvergenee analysis 
applies. 

C. Stochastic optimization using particle swarm optimization 

The researehers modeling the movement of birds or a shoal of fish diseovered that these 
movements were indeed performing optimization. This gave birth to an entire field of swarm 
intelligenee. In partieular, we foeus on PSO, a stoehastie optimization teehnique that ean provide 
an aeeeptable solution even when the problem is non-eonvex. PSO was proposed to design the 
preeoder in a CoMP setup with limited information [fTTlI . We eonsider the implementation of 
PSO as deseribed in [fTTl Algo. 2], with the addition of random initialization, giving rise to a 
multi-start PSO sueh that global optimization ean be performed. PSO is a very attraetive tool for 
preeoder design as it does not involve any matrix inversion, and the overhead of book keeping 
the number of variables is very little eompared to genetie algorithms. However, being heuristie 
in nature, it does not guarantee optimality and the algorithm might not eonverge in polynomial 
time with the inerease in problem size. 

D. Branch and Bound 

The SSOCP and the MSE reformulated approaehes are iterative algorithms where every 
sub-step is optimal and well justified, leading to monotonie improvement of the objeetive, 
with guaranteed eonvergenee. However, these approaehes even with a large number of random 
initializations is not guaranteed to obtain the optimal solution, as the problem is non-eonvex and 
NP-hard. Henee, we need to verify that our proposed solution is tightly bounded. In this regard, 
we eonsider [fTSll where braneh and bound (BB) is applied for weighted sum rate maximization 
in MISO downlink eellular networks. We reformulate this problem for joint transmission CoMP. 
This provides the lower and upper bounds for the problem with full and limited feedbaek, and 
also when the pessimistie statistieal interferenee model is used in the preeoder design. With this 
approaeh, we ean say how elose the proposed algorithm is from being globally optimum. 


17 


When limited CSI information is available at the eentral coordination node, the branch and 
bound technique can be applied via reformulating the weighted sum rate maximization in 
MISO downlink [fTSll to joint transmission CoMP networks. It is intuitive to observe that the 
reformulation is exactly the same as [|T^ however the SINRs are based on ( [^ . Instead of 
rewriting the whole branch and bound procedure as described in ifT^ . we highlight the main 
differences in the reformulated problem. The initialization of the hyperrectangle in [fT3l (9)] is 


Qinit = < 7|0 < 7u < \Su\ 

I 


/ Nq^uGU 


(26) 


The initial hyperrectangle, Q G Qinit comprises of y^ax ^ and 'y^i^ G MpiTmin = ^\u\- The 
upper limit in (|26l) can be obtained for each user, when considering only the SNR from d^. 


where the uih. user is the only user in the system without any interference. It is intuitive to 
see that using the Cauchy-Schwartz inequality we have ' ^ i \ n- i 




E 

b&Bu 


b&Bu 


<\Bu\ E |hfe,nW 6 ,„| < 

beBn 


|hb,u| I2I |wb,u| I2 ^ \Bu \ -^T-Pmax E I The last inequality is due to the per-antenna 


beBu 


power constraint. The other main contribution lies in the check for the feasibility under limited 
feedback and backhauling constraint. This is captured in Algorithm The feasibility check is 
performed as part of the BB technique, and for completeness we provide a cookbook version of 
BB in Appendix 


Algorithm 4 Check if 7 


7i) • • ■) Til) ■ ■ •) 7|w| 


is feasible. 


1: Check for feasibility by solving the convex problem 

find UBk,yuGU, 

k^lA 


subject to 


^b,u'^b.u 

yWo 


yieu, 



1 

lu 


beB^ 


2: return feasibility, and save W 5 „,V&, it when feasible 
1 The RHS of this SOCP formulation can be argued along the same lines as ll^ Sec. IV.B]. 
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IV. Simulations 

We consider |W| = 3 users that are uniformly dropped around the cell-edge at the intersection 
of \B\ = 3 BSs, where each BS has Nj = 1,3 transmit antenna(s) covering a cell-radius of 500 m. 
The cell-edge SNR is defined as the SNR experienced by one user at the cell-edge. For simplicity, 
we set au = 1,Vm. The variance of the shadow fading component is cr|p = 8dB. The receiver 
noise power is Nq = kTE^ Watts, where k is the Boltzmann’s constant 1.38x10 ^^Joules/Kelvin, 
T = 290 Kelvin is the operating temperature, and Bn = 10 MHz is the system bandwidth. 

The legends in the following figures are summarized in Table They capture as to how 
much feedback or backhauling is required or being used based on a given relative threshold. 
The algorithms without subscripts such as SSOCP, PSO, MSB, ZF, BBub and BBlb, capture 
the case of full feedback and full backhauling, when the relative threshold, T = cx)dB. The 
algorithms with subscripts such as SSOCPa,pl,o, MSEa,pl,o, BBa,ub,o and BBa,lb,o capture the 
case of limited feedback incorporating our proposed long term channel statistics, and with 
limited backhauling. The SSOCPpl.o algorithm captures the naive approach of including the long 
term channel statistics where they are directly replacing the missing channel coefficients in the 
interference terms of SINR formulation, when there is limited feedback and limited backhauling. 
The algorithms with subscripts such as SSOCPq, PSOq, BBub.o and BBlb.o capture the case of 
limited feedback and limited backhauling without the use of any side information. 

A. Ejfect of threshold and cell-edge SNR 

Kg . [3 shows the expected average weighted sum rate evaluated at the central coordination 
node when designing the precoder for various relative thresholds. It is intuitive to note that 
with complete information the performance of SSOCP, PSO and ZF are independent of the 
threshold. With limited information, PSOq and SSOCPq have similar performance. The most 
interesting curves are SSOCPa,pl,o and SSOCPpl,o, where SSOCPa,pl,o incorporates the long 
term channel statistics in the interference terms, thereby resulting in a pessimistic precoder 
design. The SSOCPpl,o naively replaces zeros with these long term channel statistics and appear 
to achieve superior performance when designing the precoder at the central coordination node. 
However, it is important to observe the actual performance of the precoder due to the transmission 
to the user. This is captured in Fig. where SSOCPa,pl,o outperforms all other cases when there 
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Table I 

The interpretation of legends related to the information available and generated at the central 

COORDINATION NODE IS LISTED BELOW. 


Legend^ 

CSI Feedback 

Precoding weights 

BBlb 

Full 

Full 

BBub 

Full 

Full 

BBa.lb.o 

Use long term stats l|^ 

Limited 

BBa.ub.o 

Use long term stats l|^ 

Limited 

BBlb.o 

Limited 

Limited 

BBub.o 

Limited 

Limited 

MSE 

Full 

Full 

MSEa.pl.o 

Use long term stats j25[> 

Limited 

PSO 

Full 

Full 

PSOo 

Limited 

Limited 

SSOCP 

Full 

Full 

SSOCPa,pl,o 

Use long term stats 

Limited 

SSOCPpL.o 

Use long term stats directly 

Limited 

SSOCPo 

Limited 

Limited 

ZF 

Full 

Full 


fThe acronyms in the legend are summarized here for convenience. The BBub and BBlb denote the upper and lower bound 
obtained from branch and bound as presented in Algorithm]^ PSO: particle swarm optimization, MSB: weighted mean square 
error, SSOCP: successive second order cone programming, and ZF: zero forcing. 

is limited information. It is interesting to note that the expected rates in Fig. are in line with 
the actual rates in Fig. for the proposed SSOCPa,pl,o approach. 

Fig. captures the effect of increasing the cell-edge SNR on the average weighted sum rate 
for a relative threshold of 9 dB. In the case of having limited information and long term channel 
statistics, SSOCPa,pl,o can be useful compared to SSOCPq. It is interesting to note that the naive 
approach SSOCPpp.o only performs well at low thresholds (not shown here), and the performance 
deteriorates at high thresholds, and also with the increase in the cell-edge SNR. 

Fig. 1^ shows the cumulative distribution function (CDF) of the weighted sum rate of the MSE 
and SSOCP based precoder. The MSE approach achieves a performance similar to that of the 
SSOCP. The MSE approach is very attractive due to the simple sub-problems being solved in 
every iteration. However, it takes a longer time for convergence due to the receiver updates. 
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Figure 1. The performance of the precoders in terms of average weighted sum rate, 7?tot expected when designing the precoder 
at the central coordination node. The cell-edge SNR is 15 dB, A^t = 1, \B\ — 3, and \IA\ = 3. 



Figure 2. The performance of the precoders in terms of the actual average weighted sum rate, Rm evaluated due to the 
transmission to the users, with increasing threshold for a given cell-edge SNR of 15 dB, Nj = 1, \B\ = 3, and \U\ = 3. 


B. Effect of number of BS antennas 

In this section, we investigate the effect of the number of BS antennas on the performance of 
the precoders. Fig. shows the CDF of the weighted sum rate where the number of antennas is 
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Figure 3. Average weighted sum rate versus cell-edge SNRs for a relative thresholds of 9 dB. 


increased to A^t = 3 at eaeh of the 3 BSs, serving 9 users, resulting in a fully loaded system. For 
the non-SSOCP eases, the per-antenna power eonstraint is applied just as in the ease of SSOCP. 
Apart from which, the whole precoding matrix is rescaled such that at least one of the antennas 
is transmitting at maximum power. Note that the per-antenna power eonstraint is more practieal 
and that the SSOCP is more eapable of utilizing this to the fullest extent. It ean be observed that 
the proposed SSOCP outperforms all other preeoding algorithms. However, the naive approach 
SSOCPpL,o performs poorly. It is interesting to note that the SSOCP has consistent eell-edge 
performanee with steeper CDF eurves eompared to the ZF approaeh. 

Fig. I^eaptures the maximum rate that is aehieved when designing the precoder at the central 
eoordination node based on the number of random initializations of the preeoder. Recall that 
increasing this number improves the ehanees of finding a solution close to the global optimum 
[|23ll . Each of the subplots in Fig. [^show the impaet on the aehievable rate of SSOCP and PSO 
with the inerease in the problem size, due to the inerease in the number of antennas at each 
of the BSs in a fully loaded system. With Nj = 1, the 3 BSs serve \U\ = 3 users. To keep 
the system fully loaded, we consider |W| = 6 when At = 2 , and |A| = 9 when At = 3. For 
limited feedback, choosing MAXRETRIES = 5 is good enough when one considers the tradeoff 
between the number of initializations and the aehievable rate based on the available CSI at the 
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Figure 4. Comparison of the CDF of SSOCP and MSB based precoders in terms of the weighted sum rate. It can be observed 
that the curves SSOCP and MSB overlap. The SSOCPa,pl,o and MSBa.pl.o curves closely overlap, when long term channel 
statistics is incorporated under limited CSI and limited backhauling constraint. The cell-edge SNR is 15 dB and the threshold 
is 3 dB. The p values in the legend shows the average value. 


central coordination node. It is interesting to note that PSO performs closer to SSOCP when 
the problem size is small, however, SSOCP outperforms consistently with the increase in the 
problem size in terms of the increase in the number of transmit antennas and the users. It should 
be noted that the power allocation with PSO is merely a scaling of the entire precoding matrix, 
as in dHll, and that with increased problem size, PSO is unable to completely make use of the 


per-antenna power constraint as in ( |6dl ). With Nj = 3, this behavior can be easily explained, 
and it can be concluded that the PSO is unable to converge in polynomial time. Due to this we 
do not consider PSO for any further analysis. 


C. Bounding the proposed SSOCP 

In this section, we show that the accuracy of the solution obtained with SSOCP is tightly 
bounded. Here, we restrict our simulations to the SSOCP based approach, as we have already 
observed in Fig. that the MSB and SSOCP have similar performance. Fig. shows the 
convergence of the branch and bound algorithm under full/limited feedback and backhauling 
conditions. The y-axis captures the weighted sum rate obtained when designing the precoder 
at the central coordination node. Notice that the convergence is slow with (BBub,BBlb) in 










23 



Figure 5. With cell-edge SNR of 15 dB and a threshold of 3 dB, \B\ = 3 BSs with Nj = 3 antennas each are serving 9 users. 
The values in the legend shows the average value. 
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Figure 6. A comparison of the performance of PSO and SSOCP with the number of random initializations for a cell-edge 
SNR of 15 dB and a relative threshold of 3 dB. 


the topmost subplot, when eompared to the convergenee behavior of the other bounds. This is 
due to the size of the problem. Also notiee that the numerieal bounds tightly eharaeterize the 
proposed SSOCP algorithm for precoder design. It should be mentioned that the bounds can be 
tightened depending on step in Algorithm The branch and bound technique is extremely 
slow in the CVX ||2^ environment even with the bisection method (Algorithm being applied 
to improve the lower bound. Likewise the CDF of the bounds are captured in Fig. for full 
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Figure 7. The convergence of the upper and lower bounds from the branch and bound procedure used to benchmark SSOCP 
and SSOCPa.pl.o, for a given realization of the aggregated channel matrix with full/limited feedback and backhauling under 
\B\ = 3, Nj = 1,\U\ ^ 3, with cell-edge SNR of 15 dB and a relative threshold of 3 dB. The MAXRETRIES = 5,20 for 
SSOCP and SSOCPa.pl.o^ respectively. 


and limited feedback and backhauling, with and without the use of long term statistics. It can 
be observed that the BB technique tightly bounds the proposed SSOCP. Note that when limited 
information is considered, the curves are those that were obtained during the precoder design 
at the coordination node. It can be observed that when the long term statistics are included, the 
precoder rate is more pessimistic. However, they perform better during actual transmission when 
the complete channel is considered as seen in Fig. 

V. Conclusions 

In this work, we have incorporated the long term channel statistics in the SSOCP algorithm 
to efficiently solve the precoder design when there is limited channel state information available 
at the central coordination node. Efficient backhauling is achieved, in a sense that the number 
of precoding weights generated for the active links is equal to the number of coefficients of the 
channel state information correspondingly available for the active links at the central coordination 
node. The above goals are accomplished with the objective of maximizing the weighted sum rate 
when jointly transmitting to a group of cell-edge users. The efficiency of the solution obtained 
with the proposed SSOCP algorithm is verified by the tight upper and the lower bounds. The 
performance of the precoder is also studied for various thresholds, cell-edge SNRs and with 
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Figure 8. The plot shows the comparison of the CDF of sum rate obtained when designing the precoder and their corresponding 
bounds. The /r values in the legend shows the average value. Here the MAXRETRIES = 20, in Algorithmic for the SSOCPa,pl,o, 
whereas a default value of MAXRETRIES = 5 results in a performance slightly lower than the lower bound. 


the increase in the problem size. Alternatively, we also derived the weighted MSE approach 
for the precoder design that incorporates the long term channel statistics when there is limited 
information, and it was shown to achieve the performance of SSOCP. 


Appendix A 

The RECEIVE VARIANCE WITH LIMITED CSI AND LONG TERM CHANNEL STATISTICS 


To obtain ( [T^ , consider ( [T5| ) 


Cu = (hb^u^b,i)^+N^ 

Vi b£Bu 


0 


uWb^i ihb,u^b,i)^ + No 

bGBu bGBu 


y^hb^u^b,^ 


beBu 


E 

i^u 


y^hb,u^b,i 

beB^ 


+ No. 
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With limited information, we eonsider the expeeted value of the inaetive links in the interferenee 
terms as 




<No + 


= iVn 


beSu 

^ ^ hf, .^Wji 
b&Bu 

b£Bu 


+ ETiT. 

i^u 

i^u 


bf^BiHBu b&Bi\Bu 

2 


y~! hf,^, 






beBi\Bu 


E 

i^u 


beBinB„ 


b&Bi\Bu 


Steps similar to (3a)-(3e) are applied here. 


Appendix B 

Cookbook version of the branch and bound that incorporates the long term 

CHANNEL STATISTICS 


For eompleteness of Section III-D[ a cookbook version of the branch and bound based on llT3ll 
is provided in Algorithm with emphasis on including the long term channel statistics into the 
precoder design, where the SINRs are checked for feasibility in Algorithm 
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Algorithm 5 The branch and bound algorithm for bounding with limited information, and 

applying bisection method to improve 7max and return the upper bound -B-Bub and the lower 

bound -B-Blb of the objective in Q. 

1: Set tolerance e = 0.1, maxiter = 100 


2: Set Qcurr — Qinit 

3: Algorithmic Bisection method to limit 7niax of Qcun 
4: Algorithm |C Check if 7 niin of Qcun is feasible 
5: Algorithmic Update BBu^, BBi^b based on the above feasibility 
6: Accumulate hyperrectangles and the corresponding bounds: 

-4 = { ( Qcurr, B\js ( Qcun ) > .Blb ( Q curr ))} 

7: while BBub — BBi^b > e AND maxiter do 

8: for a€ A do 

9: if BBijb = a.Bisa then 


10: QcuiT = a-2 

11: UcuiT “ u 

12: break 

13: end if 

14: end for 

// Branching 

15: Split the longest edge of hyperrectangle Qcuir into Qi and Q 2 

16: for i = 1,2 do 

17: Algorithmic Bisection method to limit 7niax of Qi 

18: Algorithmic Check if 7 inin of Qi is feasible 

19: Algorithm |C Update Bub ( Qi ) and Slb ( Qi ) 

20: end for 

21: Remove {ocurr} from A 

22: Update A = AU 

{(Ql, f?UB(Ql)) Slb(Qi)) , (Q 2 , .Bub(Q 2 )j Slb(Q 2 ))} 

// Bounding 

23: BBjjb = min (a.B ub) 

aGA 

24: BBlb = min (u-Bub) 

a^A 

25: maxiter = maxiter — 1 

26: end while 


27: return BBub,BBlb,W* from step 18 
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Algorithm 6 Bisection method to improve 7max for the lower bound. 

1: Set tolerance e = 0.01 

2: for Each user do 


3: 

4: 

5: 

6 : 

7: 

8 : 

9: 

10 : 

11 : 

12 : 

13: 

14: 

15: 

16: 

17: 

18: 

19: 


a = 7min + (7«,max “ 7ii,min) ' where e„ is the standard basis vector 
Set blower — 7niin and bupper = a 
Algorithm Check if a is feasible 
if feasible then 

Trt,max a 

continue step[^// Avoid unnecessary bisection steps below 

end if 
while 

t — (blower “f bupper) /2 
Algorithm]^ Check if t is feasible 
if feasible then 

blower — f 

else 

bupper f 


I bupper blower 112 A C dO 


end if 
end while 

/u,max ~ ^upp( 

20: end for 

21: return 7*,^ = 


h ,max 5 ■•• 7 Tu ,max 7 ■•■ 7 

,n 


Algorithm 7 Update the bounds based on a given hyperrectangle Qgiven and its feasibility. 

1: if feasible then 

2. Set 7?i — 7rr,min 

3: Evaluate 0, i?u(Qgiven) = R tot 

4: Set — 'Tii,max 

5: Evaluate i?L(Qg.verr) = R tot 

6: else 

7: set Bu(Qgiven) = 0 and BL(Qgiven) = 0 

8: end if 

9: return B\j ( Qgiven ) ? ( Qgiven ) 
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